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We present a derivation of Born’s rule and unitary transforms in Quantum Mechanics, from a 
simple set of axioms built upon a physical phenomenology of quantization. Combined to Gleason’s 
theorem, this approach naturally leads to the usual quantum formalism, within a new conceptual 
framework that is discussed heuristically in details. The structure of Quantum Mechanics, from its 
probabilistic nature to its mathematical expression, appears as a result of the interplay between the 
quantized number of “modalities” accessible to a quantum system, and the continuum of “contexts” 
that are required to define these modalities. 


1. Introduction. 

Deriving Born’s rule, rather than postulating it as it is 
done in standard textbooks, has been envisioned since the 
early times of Quantum Mechanics (QM) [I|. A major 
asset in this direction is Gleason’s theorem [^, whose 
critical importance for the foundations of QM has been 
recognized since it was published in 1957. The theorem is 
simple to state (see below), but difficult to demonstrate, 
and a nice presentation is provided in 

The main attempts to use Gleason’s theorem for deriv¬ 
ing Born’s rule, and then the whole quantum formalism, 
have been done in the framework of formal quantum logic 
[^. However, such approaches were not considered very 
appealing by physicists, and though Gleason’s theorem 
essentially gives the correct quantum probability law, it 
was often said that it provides no physical insight into 
why the result should be regarded as probabilities. Ac¬ 
cording to it is even considered as a motivation to 
seek a more physically transparent derivation of Born’s 
rule. This is partly because the hypothesis of Gleason’s 
theorem do not fit easily in the usual “wave function” 
approach of QM, and in particular within the superposi¬ 
tion principle, which is usually put forward as the very 
first postulate when introducing quantum mechanics. 

In this paper, we will introduce new axioms for QM 
[m, starting with three physical axioms defining quan¬ 
tum rules, without any mathematical formalism. When 
completed by a fourth mathematical axiom, it will turn 
out that the four together correspond to the hypothesis 
of Gleason’s theorem, leading straightforwardly to Born’s 
rule. Before stating this fourth axiom, we will introduce 
it heuristically, by using the three physical ones, com¬ 
pleted by a set of physical assumptions. So let us start 
with the following set of physically motivated axioms, 
which have been introduced and discussed in [iM3: 

• Axiom 1 (quantum ontology): Given a physical 
system, a modality is defined as the values of a 
complete set of physical quantities that can be pre¬ 
dicted with certainty and measured repeatedly on 
this system. The complete set of physical quantities 
is called a “context”, and the modality is attributed 
jointly to the system and the context. 


• Axiom 2 (quantization): For a given context, that 
is a given “knob settings” of the measurement ap¬ 
paratus, there exist N distinguishable modalities 
{ui}, that are mutually exclusive : if one is true, 
or verified, the other ones are wrong, or not ver¬ 
ified. The value of N, called the dimension, is a 
characteristic property of a given quantum system. 

• Axiom 3 (changing contexts): The different con¬ 
texts relative to a given quantum system are related 
between themselves by (classical) transformations g 
that have the structure of a continuous group Q. 

For the sake of clarity, we note that, within the usual 
QM formalism (not used so far), a modality and a context 
correspond respectively to a pure quantum state, and to 
a complete set of commuting observables. The axioms 
are formulated for a finite N, but this restriction will be 
lifted below. These axioms, under the acronym “GSM”, 
meaning Context, System, Modality, have been discussed 
in both physically and philosophically, and we will 
not reproduce this discussion here. We will rather con¬ 
sider the following question: it is postulated in Axiom 
2 that there are N mutually exclusive modalities asso¬ 
ciated to each given context, but there are many more 
modalities, corresponding to all possible contexts. These 
modalities are generally not mutually exclusive, but are 
incompatible: it means that if one is true, one cannot 
tell whether the other one is true or wrong. Then, how 
to relate between themselves all these modalities ? 

A first crucial result already established in is that 
this connection can only be a probabilistic one, otherwise 
the axioms would be violated; the argument is as follows. 
Let us consider a single system, two different contexts Cu 
and Cy, and the associated modalities Ui and vj, where 
i and j go from 1 to N. The quantization principle (Ax¬ 
iom 2) forbids to gather all the modalities Ui and Vj in 
a single set of 2N mutually exclusive modalities, since 
their number is bounded to N. Therefore the only rel¬ 
evant question to be answered by the theory is: If the 
initial modality is Ui in context C„, what is the condi¬ 
tional probability for obtaining modality Vj when the con¬ 
text is changed from Cu to C„ ? We emphasize that this 
probabilistic description is the unavoidable consequence 
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of the impossibility to define a unique context making 
all modalities mutually exclusive, as it would be done in 
classical physics. It appears therefore as a joint conse¬ 
quence of the above Axioms 1 and 2, i.e. that modalities 
are quantized, and require a context to be defined. 

Now, according to Axiom 3, changing the context re¬ 
sults from changing the measurement apparatus at the 
macroscopic level, that is, “turning knobs”. A typical 
example is changing the orientations of a Stern-Gerlach 
magnet. These context transformations have the mathe¬ 
matical structure of a continuous group, denoted G ■ the 
combination of several transformations is associative and 
gives a new transformation, there is a neutral element 
(the identity), and each transformation has an inverse. 
Generally this group is not commutative : for instance, 
the three-dimentional rotations associated with the ori¬ 
entations of a Stern-Gerlach magnet do not commute. 
For a given context, there is a given set of N mutually 
exclusive modalities, denoted {ui}. By changing the con¬ 
text, one obtains N other mutually exclusive modalities, 
denoted {u^}, and one needs to build up a mathematical 
formalism, able to provide the probability that a given 
initial modality {ui} ends up in a new modality {vj}. 

The standard approach at this point is to postulate 
that each modality Ui is associated with a vector \ui) 
in a A^-dimensional Hilbert space, and that the set of 
N mutually exclusive modalities in a given context is 
associated to a set of N orthonormal vectors. Rather 
than vectors \ui) and \vj), one can equivalently use rank- 
one projectors Pm and Py. , and Born’s rule giving the 
conditional probability p{vj\ui) can be written as 

p(yj\ui) = Tra.ce{Pu.Py.). (1) 

As we will show below, after postulating that each 
modality is associated to an Hermitian projector acting 
in a suitable Hilbert space, there is actually no need to 
postulate also Born’s rule: it follows immediately as a 
consequence of Gleason’s theorem, and the transforma¬ 
tion of projectors associated with a change of context 
must be unitary. Before doing that, we shall first justify 
heuristically why each modality should be associated to 
a projector. This will be done in Section 2, then we will 
come back to Gleason’s theorem in Section 3, and finally 
discuss some consequences of our approach in Section 4. 

2. Heuristics without Gleason’s theorem. 

The main goal of this heuristics is to give a justifi¬ 
cation for Axiom 4 (given in Section 3 below), telling 
that each modality is associated to an Hermitian projec¬ 
tor acting in a suitable Hilbert space. For this purpose, 
we will start from Axioms 1-3 only, and introduce a 
set of assumptions to construct a consistent probability 
theory, by imposing some requirements on what it should 
describe. This will lead us to associate modalities with 
projectors in an Hilbert space, and to get Born’s rule 
and unitary transforms on the way. The more formal 
proofs will be given in Section 3 by using Axiom 4. 


The general probability matrix 

Since the {ui} and {u^} are by definition non-exclusive 
modalities, one has to introduce the probabilities of find¬ 
ing the particular modality Vj (in the new context), when 
one starts in modality Ui (in the old context). There are 
N'^ such probabilities, that can be arranged in a matrix 
n^lu = {Pvj\ui), containing all probabilities connecting 
the N modalities in each context {ui} and {vj}. Since 
one has obviously 0 < Pv^im ^ 1 and Sj Pvj\ui = li the 
matrix n„|u is said to be a stochastic matrix. 

For clarity, let us emphasize the interpretation of the 
conditional probability notation: in agreement with the 
definition of modalities as certainties, the meaning of 
Pvj\ui is that “if we start (with certainty) from modal¬ 
ity Ui in the old context, then the probability to get 
modality Vj in the new context is The matrix 

of all Pvj\ui provides the starting point for our heuristic 
approach, by which theoretical predictions are connected 
to experiments. For TV = 3, one will have for instance 

( Pv-i\u\-i Pv2\u\'! Pv2\u\ 1 Pv-i\u\ Pv2\u-1 \ 

Pv-i\u2-i Pv2\u2'! Pv2\u2 ^ Pv-i\u2 Pv2\u2 I 

Pvx\u3-: Pv2\u3': Pv3\u3 1 Pvx\u3 Pv2\u3 ) 

As we will see below, TV > 3 is required because some 
crucial properties of n^|„ do not show up for TV = 2. 

Let us also define a “return” probability matrix n„|^, 
by exchanging the roles of the initial and final contexts. 
The matrix n„|„ has the same properties as n„|„, but 
these two matrices are a priori unrelated, whereas it is 
well known that in standard QM, they are transpose of 
each other. In the following, we will introduce simple 
assumptions which will constraint these matrices to 
being unistochastic, i.e. that their coefficients are the 
square moduli of the coefficients of a unitary matrix 
[l^ : and then, to being transpose of each other. 


A mathematical identity 

In order to manipulate the n„|„ and n„|„ matrices, it 
is convenient to introduce orthogonal (TV x TV) projectors 
Pi, that are zero everywhere, except for the term on 
the diagonal that is equal to I. These projectors verify 
the relation PiPj = PiSij. A useful operation is then 
to extract the particular probability Pv^lut from n„|„, or 
Pui\vj from n„|„, and one has the following identities : 

Pvj\ui = Tr(^i Pi Sii|u) = Pj s|^|^) (2) 

Pui\vj = Tr{Pi Pj 'P‘u\v) = Pi S]^|^) (3) 

where Tr is the Trace, ^ is the Hermitian conjugate, and 




( 4 ) 


are TV x TV matrices formed by square roots of the prob¬ 
abilities, and by arbitrary phase factors which are intro¬ 
duced for the sake of generality, and don’t play any role at 
that stage. We emphasize that the equations above are 
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only mathematical identities, and don’t tell more than 
what is already contained in the definition of the matri¬ 
ces and n„|^. A useful marginal case is the situation 
where the context is not changed, so u = v, and 

Puj\ui = T^iPj Pi) = ( 5 ) 

where Puj\ui = is obviously consistent with mutually 
exclusive modalities within a given context. 

From Eqs. [H [3]the elements Pji of a general stochastic 
matrix 11 can be written as (the subcripts u\v or v\u are 
omitted for simplicity): 

p,, = Tr(P,SP,St). (6) 

Now, according to the singular values theorem, there 
must exist two unitary matrices U and V, and a real 
diagonal matrix R, such that 

i: = URV\ = (7) 

where the diagonal values of R are the square roots of 
the (real) eigenvalues of equal to those of and 
are called the singular values of E [l^ . The matrix E is 
unitary iff R is the identity matrix 1. We note that the 
value of Tr(i?^) is the sum of the square moduli of all 
the coefficients of E, and is therefore equal to N. For a 
generic stochastic matrix If, EE^ has diagonal coefficients 
equal to 1, but is not diagonal, whereas R^ is diagonal, 
and its N coefficients are real, positive, and sum to W, 
but are not necessarily equal to one. 

Using Eqs. (EHZl), Pji can now be written: 

Pji = Tr{P, URV^ Pj VRU^i) 

= Tr ((U^'Pi U) R {V^PjV) R) (8) 

This equation is again a mathematical identity, on which 
we shall now impose physical constraints. In the section 
below we will consider E„|„, but obviously the same 
arguments are also valid for E„|„. 

Physical constraints on the probability matrix. 

Given Axioms 1 and 2, our main physical argument 
is that the probability Pvj\ui should only depend on the 
particular modalities Ui and Vj being considered, and not 
on the whole contexts in which they are embedded. This 
important property of “non-contextuality” for modali¬ 
ties [ 2 ^ is related to Gleason’s theorem, and it will ap¬ 
pear again in Section 3. It tells that the same modal¬ 
ity can pertain to different contexts, and therefore can 
be defined (in particular, mathematically) independently 
of other modalities in a given context. This (quantum) 
non-contextuality is fully compatible with contextual ob¬ 
jectivity [l5l - [T7j : the latter states that a modality needs 
a context to be defined, whereas the former tells that the 
same modality can be defined in several contexts. 

In order to fulfill this condition, the decomposition of 
Eq. ([5]) suggests that it might be possible to separate two 
parts (within parenthesis) associated with the two spe¬ 
cific modalities Ui and Vj. However, if the singular values 


of the matrix E„|„ are not equal to 1, the matrix R ^ 1 
will impose a context-dependent structure on the whole 
sets of modalities {ui} and {uj}. There is nevertheless a 
way to warrant that R does not depend on E„|„, still sat¬ 
isfying the constraints spelled out above: it is to impose 
that R = \. Therefore, in order to have the probability 
depending on separate mathematical objects associated 
with each modality, we will posit the basic assumption: 

• Assumption 1: In order to ensure that Pvj\ui de¬ 
pends only on the two particular modalities being 
considered, the N singular values of E„|„ must be 
all equal together, and thus are all equal to one. 

Then as said above Ei,|„ will be a unitary matrix UV\ 
but one may wonder whether orthogonal (real) matrices 
might be enough. In order to justify that the full unitary 
set is required, we shall use a second assumption: 

• Assumption 2: Since the change of contexts corre¬ 
sponds to a continuous group (Axiom 3), the set of 
matrices E„|„ must be connected in a topological 
sense, and must contain the identity matrix. 

Then it is known that the set of orthogonal matrices is 
topologically disconnected in two parts with determinant 
-|-1 and —1, which contradicts the above assumption. 
For instance, permutation matrices are not connected 
to the identity, whereas they correspond simply to 
“relabelling” the modalities, i.e. to a trivial change 
of context. On the other hand, all (complex) unitary 
matrices are connected to the identity, and do agree with 
Assumption 2 (for other arguments see refs. 

Unitary matrices and Born’s formula 

We are thus lead to the conclusion that E„|„ must be 
a unitary matrix Siiiu, with 5^1^ = Then Eqs. m 

for picking up a particular probability become: 

Pv^\Ui = T^iPj ■ sl^u ■ ■ ^v\u) 

= Tr(P,.5,|„.P,.^^|J (9) 

which shows that the matrix n„|„ must be unistochastic, 
i.e. made by the square modulus of the coefficients of a 
unitary matrix. Such matrices are also bistochastic, i.e. 
their lines and rows sum to 1 [ 2 ^. Then we can define 

~ I Pj = ^v\u ■ Pj ■ ■ (10) 

It is clear that these operators are all Hermitian projec¬ 
tors, i.e. one has P^ = P and P^ = P for each of them, 
and also that all sets {P/} and {P"} have the same or¬ 
thogonality properties as the initial set of projectors {Pi}, 
given by Eq. ®. One can thus rewrite Eq. m as: 

= Tr{P, P') = Tr(P, P")- (H) 

This is just Born’s formula (Eq. [T|), which is obtained 
here heuristically, rather than from a postulate. 
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Finally, the obvious next step is to associate projectors 
with modalities in each context, and for the matrix 
it can be done in two consistent ways as seen above: 

old contextjMi} —>■ new contextjuj} (12) 

Pl= Sl^u-P^-Sv\u ^ P, 

Pi —t Pj = Sy\u ■ Pj ■ 

One can now come back to the matrix n„|„, for which 
the same reasoning is valid, and leads to a unitary matrix 
By reverting the contexts one has thus: 

old context{?;j} —>■ new contextjMi} (13) 
^ 

P, ^ Q'= 

But since projectors are now associated with modalities, 
they should be the same for a given modality in a given 
context, i.e. one should have P" = Q", and P/ = Q'. 
This is obtained if is the inverse of leading to 
a third assumption: 

• Assumptions: In order to associate projectors with 
modalities in a consistent way, the matrices n„|^ 

and n„|„ must be related by 
and thus n„|„ = n*|„. 

Then the various point of views represented in the rela¬ 
tions (mm are all consistent and give the same values 
for the probabilities, because each S'„|„ can be associ¬ 
ated to an element of the group of context transforma- 
tions G, and its inverse is S'„|„ = For the 

general consistency of the approach, this set of matrices 
gives a, N X N (projective) representation of the group of 
context transformations; this is fully consistent with the 
well known Wigner theorem [25l |. This continuous uni¬ 
tary evolution will be essential to describe the evolution 
of the system (translation in time), and it is also related 
to Theorem 2 in Sec. 3 below. 

The identification of the matrix as being a unitary 
matrix and the association of projectors to modal¬ 
ities, are the results we were looking for; in the next 
section they will be restated as Axiom 4 in our frame¬ 
work. In the above heuristic calculation, valid in the 
finite-dimentional case, they appear to be a joint conse¬ 
quence of the three assumptions made above, and of the 
mathematical identity given by Eqs. da ED- By starting 
from Axiom 4, the Trace formula used in this identity 
will turn out to be the only possible choice. 

We also obtained Born’s formula, apparently avoiding 
the heavy machinery of Gleason’s theorem, because we 
use the tools of linear algebra applied to real or complex 
N X N matrices, where all the required mathematical 
properties are already embedded. In order to obtain 
a full mathematical proof, we will now formally state 
Axiom 4, and deduce Born’s rule in the general case. 


3. Born’s rule from Gleason’s theorem. 

Here we add explicitly a fourth axiom, which associates 
modalities with projectors in a Hilbert space. Then we 
will demonstrate two theorems, which are respectively 
Born’s rule, and the unitary evolution of projectors. The 
axiom and the theorems are as follows. 

• Axiom 4 : For a system with dimension N, each 
modality is bijectively associated with a N x N 
Hermitian rank-one projector Pi [Pj = P^ = Pi). 
Each set of N modalities within a given context is 
associated to a set of N such projectors, verifying 
PiPj = Pi 6ij, and J2i Pi = 1- The same projector 
(and therefore the same modality) may be part of 
different contexts. 

• Theorem 1 (Born’s formula): If the system is 
known to be in the modality Ui from the set {ui}, 
the probability that it is found in modality vj from 
the set {vj} corresponding to another context ob¬ 
tained by the context transformation is: 

-Tr(P,.P;) (14) 

where Pi and P) are respectively associated to the 
modalities Ui and vj. 

• Theorem 2 (unitary transforms): The different 
sets of projectors corresponding to different con¬ 
texts are related by unitary transformations S'„|„, 
i.e. one has P' = S'„|„ . Pj . 5^1^. 

Proof. Let us first remind Gleason’s theorem ii : 

Let f{Pi) be a function of rank-one projectors Pi in a 
real or eomplex Hilbert space with a dimension larger than 
2, to the interval [0,1] of real numbers. Let assume that 
'Yhif^Pi) — 1 {Pi} of mutually orthogonal 

projectors (PiPj = Pi Sij) verifying J^iPi — Then 
there is a unique positive Hermitian operator p with unit 
trace so that f{Pi) = Tr(pPi) for all Pi. 

According to Axiom 4, each modality is bijectively as¬ 
sociated to an Hermitian rank-one projector Pi, and any 
context is associated to a set of N mutually orthogonal 
projectors {Pi} verifying ^ ■ Pi = Ld. Let us start from a 
context {Pi}, and go to another context {P}}, which may 
actually be the same as {Pi}. Since one has necessarily 

p{Pj\Pi) = 1 for any set {P}}, there exist a unique p 
such that p{P'\Pi) = Tr(Pjp). 

In addition, one may choose P' = Pi, and then (since p 
is unique) p{Pi\Pi) — Tr{Pip) = 1. This is possible only 
if p = Pi, and we obtain the expected Born’s formula 
p{P'\Pi) = Tr{P'Pi). This proves Theorem 1. 

In addition, since {Pi} and {P}} are sets of projectors 
onto two orthonormal basis (Axiom 4), there is a unitary 
transform Sj^i such that P[ = S'jp . Pj . up to some 
relabelling of the basis. This proves Theorem 2. 
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Let us note that the main hypothesis for Gleason’s 
theorem, i.e. that probabilities sum to one for any set 
of mutually orthogonal projectors summing to identity, 
is a joint consequence of Axiom 2, i.e. that there are N 
mutually exclusive modalities in each given context, and 
Axiom 4, i.e. that mutually orthogonal projectors are 
associated to these modalities. This remark allows us to 
lift the restriction on a finite value of N: since Gleason’s 
theorem is valid in any dimension. Axioms 2 and 4 can 
also be considered valid for any N [2^. This means also 
that the (classical) additivity of probabilities can be used 
within a given context 

Another more implicit hypothesis is that f{Pi) de¬ 
pends only on Pi, and not on other (orthogonal) Pj^i 
within the given set in Pj = Id] this property is 
usually called “non-contextuality” [2^, and we already 
introduced it as Assumption 1. It means that, given an 
initial modality, the conditional probability depends on 
the particular outcome modality considered, and not 
on other modalities within a given outcome context. 
Though this hypothesis may be considered very strong 
[^ . it fits perfectly with our “objective” definition of 
modalities [isl - fl^ : though a modality needs a context to 
be observed, the same modality may appear in different 
contexts, always associated with the same projector [^ . 
Therefore the physical Axioms 1-3, complemented by 
the mathematical Axiom 4, do allow us to deduce Born’s 
rule from Gleason’s theorem. 


4. Discussion 

Since we have now reached the starting point of most 
QM textbooks [1^, it should be clear that the standard 
structure of QM can be obtained from the above axioms 
31j . In particular, one can associate the N orthogo¬ 
nal projectors {Pi} to the N orthonormal vectors which 
are eigenstates of these projectors up to a phase factor, 
i.e., to rays in the Hilbert space. Similarly, the expected 
probability law for the measurement results {ci} will be 
obtained by writing any physical quantity A as as an 
operator A = ^ diPi, this is the usual spectral theorem. 

We emphasize that we do not need any additional 
“measurement postulate”, since measurement is already 
included in Axiom I, i.e. in the very definition of a 
modality (see detailed discussions in (la - H^ l. Quantum 
superposition are certainly there as usual, but they are 
not spooky “dead-and-alive” concepts: they are rather 
the manifestation of a modality (i.e., a certainty) in 
another context. Entanglement is also present as linear 
superpositions of tensor product states, corresponding 
to modalities in a “joint” context. In a two-particle 
Bell-EPR experiment [s^, the entangled modality 


is defined in a joint context (e.g., a singlet state for 
two spins), and it is incompatible with a separable 
modality corresponding to separate measurements. 
When a measurement is done on one side for one particle 
only, there is no influence or action at a distance, but 
the system (still not measured on the other side) may 
be quite far from the new context resulting from the 
partial measurement. Since a modality requires both a 
context and a system, one sees that it embeds non-local 
features, corresponding to quantum non-locality, but 
fully compatible both with relativistic causality and 
with physical realism [ 13 . 


The view about the “classical vs quantum” dilemma 
that emerge from our approach has been discussed in 
details in ITj . It does agree with physical realism, given 
that classical objectivity has been replaced by contextual 
objectivity [laEl! expressed by Axiom I. This Axiom 
takes from EPR their definition of “elements of physical 
reality” based on full predictability and reproducibil¬ 
ity, and from Bohr the idea that such a physical reality 
must include “the very conditions which define the possi¬ 
ble types of pre dictions regarding the future behavior of 
the system” [^, i.e., the context. Therefore physical re¬ 
ality does not belong any more to the system alone, but 
pertains jointly to the Gontext, System, and Modality 
(GSM). This approach allows one to distinguish clearly 
between the modality, which is basically a real physical 
phenomenon, or a physical event in the sense of proba¬ 
bility theory, and the projector, which is a mathematical 
tool for calculating non-classical probabilities. This point 
of view also provides novel answers to questions about the 
“reality of the wave-function”. 


To conclude, let us emphasize that we discussed a 
very idealized version of QM, based on pure states and 
orthogonal measurements. Nevertheless, this idealized 
version does provide the basic quantum framework, 
and connects the experimental definition of a physical 
quantity and the measurement results in a consistent 
way, both physically and philosophically El- Adding 
more refined tools such as density matrices, imperfect 
measurements, POVM, open systems, decoherence, is of 
great practical interest and use, but this will not “soften” 
the basic ontology of the theory, as it is presented here. 
The present work, deeply rooted in ontology, is thus 
complementary to many recent related proposals [iS3- 
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